Statistical Entity Ranking with Domain Knowledge
نویسندگان
چکیده
Entity search is a new application meeting either precise or vague requirements from the search engines users. Baidu Cup 2016 Challenge just provided such a chance to tackle the problem of the entity search. We achieved the first place with the average MAP scores on 4 tasks including movie, tvShow, celebrity and restaurant. In this paper, we propose a series of similarity features based on both of the word frequency features and the word semantic features and describe our ranking architecture and experiment details.
منابع مشابه
ESearch: Incorporating Text Corpus and Structured Knowledge for Open Domain Entity Search
The paper introduces an open domain entity search system called ESearch, which aims at finding a list of relevant entities to an open domain entity search query (a natural language question). The system is built on top of a Wikipedia text corpus, as well as the structured DBPedia knowledge base. Entities are initially ranked by a model which effectively associates context matching (based on the...
متن کاملPresenting a method for extracting structured domain-dependent information from Farsi Web pages
Extracting structured information about entities from web texts is an important task in web mining, natural language processing, and information extraction. Information extraction is useful in many applications including search engines, question-answering systems, recommender systems, machine translation, etc. An information extraction system aims to identify the entities from the text and extr...
متن کاملRetrieval and Ranking of Semantic Entities for Enterprise Knowledge Management Tasks
We describe a task-sensitive approach to retrieval and ranking of semantic entities, using the domain information available in an enterprise. Our approach utilizes noisy namedentity tagging and document classification, on top of an enterprise search engine, to provide input to a novel ranking metric for each entity retrieved for a task. Retrieval is query-centric, where the user query is the ta...
متن کاملExploiting Multiple Sources for Open-Domain Hypernym Discovery
Hypernym discovery aims to extract such noun pairs that one noun is a hypernym of the other. Most previous methods are based on lexical patterns but perform badly on opendomain data. Other work extracts hypernym relations from encyclopedias but has limited coverage. This paper proposes a simple yet effective distant supervision framework for Chinese open-domain hypernym discovery. Given an enti...
متن کاملProduct Aspect Ranking Using Domain Dependent and Domain Independent Review
In today’s world, internet is the main source of information. There are many blogs and forum sites available where people discuss on different issues and also almost all ecommerce website provide facility to the users to express opinion about their product and services which is important information available on the internet .The problem with this information is that this reviews are mostly not...
متن کامل